Word Sense Disambiguation Based on Weight Distribution Model with Multiword Expression
نویسندگان
چکیده
This paper proposes a two-phase word sense disambiguation method, which filters only the relevant senses by utilizing the multiword expression and then disambiguates the senses based on Weight Distribution Model. Multiword expression usually constrains the possible senses of a polysemous word in a context. Weight Distribution Model is based on the hypotheses that every word surrounding a polysemous word in a context contributes to disambiguating the senses according to its discrimination power. The experiments on English data in SENSEVAL-1 and SENSEVAL-2 show that multiword expression is useful to filter out irrelevant senses of a polysemous word in a given context, and Weight Distribution Model is more effective than Decision Lists.
منابع مشابه
Improving Word Translation Disambiguation by Capturing Multiword Expressions with Dictionaries
The paper describes a method for identifying and translating multiword expressions using a bi-directional dictionary. While a dictionarybased approach suffers from limited recall, precision is high; hence it is best employed alongside an approach with complementing properties, such as an n-gram language model. We evaluate the method on data from the English-German translation part of the crossl...
متن کاملMultiwords and Word Sense Disambiguation
This paper studies the impact of multiword expressions on Word Sense Disambiguation (WSD). Several identification strategies of the multiwords in WordNet2.0 are tested in a real Senseval-3 task: the disambiguation of WordNet glosses. Although we have focused on Word Sense Disambiguation, the same techniques could be applied in more complex tasks, such as Information Retrieval or Question Answer...
متن کاملLIHLA: A lexical aligner based on language-independent heuristics
Alignment of words and multiword units plays an important role in many natural language processing applications, such as example-based machine translation, transfer rule learning for machine translation, bilingual lexicography, word sense disambiguation, etc. In this paper we describe LIHLA, a lexical aligner which uses bilingual probabilistic lexicons generated by a freely available set of too...
متن کاملEvaluating the LIHLA lexical aligner on Spanish, Brazilian Portuguese and Basque parallel texts
Alignment of words and multiword units plays an important role in many natural language processing applications, such as example-based machine translation, transfer rule learning for machine translation, bilingual lexicography, word sense disambiguation, etc. In this paper we describe LIHLA, a lexical aligner which uses bilingual probabilistic lexicons generated by a freely available set of too...
متن کاملرفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA
Word sense disambiguation is the task of identifying the correct sense for the word in a given context among a finite set of possible sense. In this paper a model for farsi word sense disambiguation is presented. The model use two group of features: first, all word and stop words around target word and topic models as second features. We extract topics from a farsi corpus with Latent Dirichlet ...
متن کامل